SSML Extensions Aimed To Improve Asian Language TTS Rendering

نویسندگان

  • Jilei Tian
  • Xia Wang
  • Jani Nurminen
چکیده

Both formant synthesis based and concatenative acoustic unit based TTS systems have been developled in Nokia. Many non-English languages have been considered in the development work, and Nokia's Mandarin Chinese TTS system is under continuous development within the TC-STAR framework (www.tc-star.org). To meet the needs of the TTS evaluations in TC-STAR, common interfaces for the input and all the internal modules have been carefully defined. SSML has been taken into use as the input format, and Nokia has proposed extensions related to the Asian language peculiarities.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Implementing an SSML compliant concatenative TTS system

The W3C Speech Synthesis Markup Language (SSML) unifies a number of recent related markup languages that have emerged to fill the perceived need for increased, and standardized, user control over Text to Speech (TTS) engines. One of the main drivers for markup has been the increasing use of TTS engines as embedded components of specific applications – which means they are in a position to take ...

متن کامل

Multilayered extensions to the speech synthesis markup language for describing expressiveness

In this paper we discuss possible extensions to the Speech Synthesis Markup Language (SSML) to facilitate the generation of synthetic expressive speech. The proposed extensions are hierarchical in nature, allowing specification in terms of physical parameters such as instantaneous pitch, higher-level parameters such as ToBI labels, or abstract concepts such as emotions. Low-level tags tend to c...

متن کامل

A Corpus-based Approach to <ahem/> Expressive Speech Synthesis

Human speech communication can be thought of as comprising two channels – the words themselves, and the style in which they are spoken. Each of these channels carries information. Today's most-advanced text-to-speech (TTS) systems such as [1],[2],[3],[4] fall far short of human speech because they offer only a single, fixed style of delivery, independent of the message. In this paper, we descri...

متن کامل

SSML Goes International – A Standard Story

Since September 2004, the SSML 1.0 [1] specification has been a W3C Recommendation. SSML is the standard way that a Voice Browser controls speech synthesis engine. Given that it is a standard, actions to define the language of the text to be rendered, to change between several voices, to insert pauses, to perform simple text normalization (e.g. acronym expansions, such as reading W3C as “World ...

متن کامل

Towards Synthesis of Focus in Mandarin Text-to-speech System

This paper introduces the significance of synthesis of focus in Mandarin text-to-speech (TTS) system, as well as the key challenges in research on synthesis of focus. The proposal on the extension of Speech Synthesis Markup Language (SSML) is presented for the improvement of intelligibility of key words or phrases, and also demonstrated by an example finally.

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2005